popSTR: population-scale detection of STR variants

نویسندگان

  • Snædís Kristmundsdóttir
  • Brynja D. Sigurpálsdóttir
  • Birte Kehr
  • Bjarni V. Halldórsson
چکیده

Motivation Microsatellites, also known as short tandem repeats (STRs), are tracts of repetitive DNA sequences containing motifs ranging from two to six bases. Microsatellites are one of the most abundant type of variation in the human genome, after single nucleotide polymorphisms (SNPs) and Indels. Microsatellite analysis has a wide range of applications, including medical genetics, forensics and construction of genetic genealogy. However, microsatellite variations are rarely considered in whole-genome sequencing studies, in large due to a lack of tools capable of analyzing them. Results Here we present a microsatellite genotyper, optimized for Illumina WGS data, which is both faster and more accurate than other methods previously presented. There are two main ingredients to our improvements. First we reduce the amount of sequencing data necessary for creating microsatellite profiles by using previously aligned sequencing data. Second, we use population information to train microsatellite and individual specific error profiles. By comparing our genotyping results to genotypes generated by capillary electrophoresis we show that our error rates are 50% lower than those of lobSTR, another program specifically developed to determine microsatellite genotypes. Availability and Implementation Source code is available on Github: https://github.com/DecodeGenetics/popSTR. Contact [email protected] or [email protected].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی اطلاع‌دهندگی مارکر D9S1876 واقع در ناحیه ژنTMC1 در جمعیت ایرانی

Background and Objective: TMC1 gene mutations are known as the most common causes of autosomal recessive non-syndromic hearing loss (ARNSHL) in different populations. According to large size of the TMC1 gene and the large number of identified mutations in this gene, application of polymorphic markers is suggested for carrier detection and prenatal diagnosis in families. In this study, informati...

متن کامل

Genetic Variation of Informative Short Tandem Repeat (STR) Loci in an Iranian Population

In the present study, genotyping of six short tandem repeat (STR) loci including CSF1PO, D16S539, F13A01, F13B, LPL and HPRTB was performed on genomic DNA from 127 unrelated individuals from the Iranian province of Isfahan. The results indicated that the allele and genotype distributions were in accordance with Hardy-Weinberg expectations. The observed heterozygosity (Ho), expected heterozygosi...

متن کامل

Genotyping of Five Polymorphic STR Loci in Iranian Province of Isfahan

Genotyping for five short tandem repeat (STR) loci HUMvWA, HUMFES, HUMTPO, HUMTH01 and D3S1359 was done in 220 unrelated individuals from the population of Isfahan province of IR Iran. The loci were genotyped using the polymerase chain reaction (PCR) followed by polyacrylamide gel electrophoresis (PAGE) and silver staining. The data demonstrated that the STR markers were all found informative i...

متن کامل

Validation of quantitative fluorescent-PCR for rapid prenatal diagnosis of common aneuploidies in the Chinese population.

Quantitative fluorescent polymerase chain reaction (QF-PCR) is an accurate and reliable method for rapid detection of aneuploidy; however, it is not routinely used in China. We aimed to validate QF-PCR as a means for prenatal common aneuploidy screening and to analyze the heterozygosities of short tandem repeat (STR) markers in the Chinese population. The sequences of 19 STR markers in chromoso...

متن کامل

Genetic analysis of two STR loci (VWA and TPOX) in the Iranian province of Khuzestan

Objective(s): Short tandem repeat (STR) loci are the most informative DNA genetic markers for attempting to individualize biological material for application in paternity and forensic cases. Materials and Methods: Blood samples were collected and the total genomic DNA was extracted. The DNA samples were used for genotyping VWA and TPOX STR loci using PCR and polyacrylamide gel electrophoresis. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 33 24  شماره 

صفحات  -

تاریخ انتشار 2017